Analysing fundamental frequency contours and local speech rate in map task dialogs

نویسندگان

  • Hansjörg Mixdorff
  • Hartmut R. Pfitzinger
چکیده

The current paper reports the first results from the analysis of task-oriented dialogs using a Fujisaki model-based parameterization of F0 contours, as well as a model of the perceptual local speech rate. Two versions of map task style dialogs were examined: (1) the recordings made during the map task proper, (2) readings from scripts of the original dialogs by the same subjects. The first part of this paper presents an analysis of phrase boundaries with respect to form and function. A second issue is the problem of processing fillers, hesitations and repairs within the framework of the Fujisaki model-based analysis. The second part of the paper describes the comparative analysis of spontaneous and read versions of the same dialog fragments with respect to Fujisaki model parameters, contours of the perceptual local speech rate, and other features. In a perception test we asked listeners to identify the speaking style of dialog fragments. Apparently this was possible only for part of the data. Analysis of accent commands and perceptual local speech rate contours still suggested differences between the two speaking styles. The number of accented syllables, the associated accent commands amplitudes, and the perceptual local speech rate were generally higher in the read than in the spontaneous utterances. These results were almost significant despite the fact that the read version had been well re-enacted by the subjects and therefore did not exactly exhibit typical reading style characteristics. Despite this drawback, the methodology presented here has strong potential for further comparative prosodic studies of speaking styles. 2005 Published by Elsevier B.V.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantitative Analysis of Prosody in Task-Oriented Dialogs

The current paper reports first results from the analysis of task-oriented dialogs using a Fujisaki model based parameterization of F0 contours. Two versions of map task style dialogs were examined: (1) the recordings made during the map task proper, (2) readings from scripts of the original dialog by the same speakers. In the scope of this paper an analysis of phrase boundaries with respect to...

متن کامل

Identification and automatic generation of prosodic contours for a text-to-speech synthesis system in French

This paper presents the realisation of an automatically trainable computational prosodic model for French Textto-Speech Synthesis. The methodology proposes the construction of the model in two steps. The first step consists in predicting fundamental frequency contours and duration of syllables from abstract prosodic markers using neural networks [17,12]. In this step, the abstract prosodic mark...

متن کامل

Icassp ' 92 . Dp - Based Determination of F 0 Contours from Speech Signalsa

A new algorithm for the determination of fundamental frequency (F0) contours is presented. For each voiced frame appropriate divisors of the frequency with the maximum energy in the spectrum are taken as F0 candidates. An F0 contour is computed using a dynamic programming (DP) method by minimizing a weighted sum of the diierence between consecutive candidates and the distance of the candidates ...

متن کامل

Prosodic word boundary detection using mora transition modeling of fundamental frequency contours -speaker independent experiments-

We have been developing a reliable method for prosodic word boundary detection for Japanese continuous speech based on the discrete hidden Markov modeling of fundamental frequency (F 0 ) contours in mora unit. Although a favorable result was obtained for ATR continuous speech corpus as reported already, experiments were done only on closed conditions. This paper reports the results on open and ...

متن کامل

Nearly perfect detection of continuous f_0 contour and frame classification for TTS synthesis

We present a new method for the estimation of a continuous fundamental frequency (F0) contour. The algorithm implements a global optimization and yields virtually error-free F0 contours for high quality speech signals. Such F0 contours are subsequently used to extract a continuous fundamental wave. Some local properties of this wave, together with a number of other speech features allow to clas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 46  شماره 

صفحات  -

تاریخ انتشار 2005